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Abstract 

Results on the behaviour of the rightmost particle in the nth generation 
in the branching random walk are reviewed and the phenomenon of 
anomalous spreading speeds, noticed recently in related deterministic 
models, is considered. The relationship between such results and certain 
coupled reaction-diffusion equations is indicated. 
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1 Introduction 



I arrived at the University of Oxford in the autumn of 1973 for post- 
graduate study. My intention at that point was to work in Statistics. 
The first year of study was a mixture of taught courses and designated 
reading on three areas (Statistics, Probability, and Functional Analysis, 
in my case) in the ratio 2:1:1 and a dissertation on the main area. As part 
of the Probability component, I attended a graduate course that was an 



exposition, by its author, of the material in Hammersley (1974), which 



had grown out of his contribution to the discussion of John's invited 



paper on subadditive ergodic theory (Kingman 1973). A key point of 



Hammersley's contribution was that the postulates used did not cover 
the time to the first birth in the nth generation in a Bellman-Harris 
processpl [Hammersley (1974) showed, among other things, that these 
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quantities did indeed exhibit the anticipated limit behaviour in probab- 
ility. I decided not to be examined on this course, which was I believe a 
wise decision, but I was intrigued by the material. That interest turned 
out to be critical a few months later. 



By the end of the academic year I had concluded that I wanted to 
pursue research in Probability rather than Statistics and asked to have 
John as supervisor. He agreed. Some time later we met and he asked 
me whether I had any particular interests already — I mentioned Ham- 
mersley's lectures. When I met him he was in the middle of preparing 
something (which I could see, but not read upside down). He had what 
seemed to be a pile of written pages, a part written page and a pile 
of blank paper. There was nothing else on the desk. A few days later 



a photocopy of a handwritten version of Kingman (1975), essentially 



identical to the published version, appeared in my pigeon-hole with the 
annotation "the multitype version is an obvious problem" — I am sure 
this document was what he was writing when I saw him. (Like all re- 
miniscences, this what I recall, but it is not necessarily what happened.) 
This set me going. For the next two years, it was a privilege to have John 
as my thesis supervisor. He supplied exactly what I needed at the time: 
an initial sense of direction, a strong encouragement to independence, 
an occasional nudge on the tiller about what did or did not seem tract- 
able, the discipline of explaining orally what I had done, and a ready 
source on what was known, and where to look for it. However, though 
important, none of these get to the heart of the matter, which is that 
I am particularly grateful to have had that period of contact with, and 
opportunity to appreciate first-hand, such a gifted mathematician. 



Kingman ( 1975 ) considered the problem Hammersley had raised in 



its own right, rather than as an example of, and adjunct to, the general 
theory of subadditive processes. Here, I will say something about some 
recent significant developments on the first-birth problem. I will also go 
back to my beginnings, by outlining something new about the multitype 
version that concerns the phenomenon of 'anomalous spreading speeds'. 



which was noted in a related context in jWeinberger et al. (2007). Certain 
martingales were deployed in Kingman ( 1975 ). These have been a fruitful 



topic in their own right, and have probably received more attention since 



then than the first-birth problem itself (see Alsmeyer and Iksanov ( 2009 ) 



for a recent nice contribution on when these martingales are integrable). 
However, those developments will be ignored here. 
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2 The basic model 

The branching random walk (BRW) starts with a single particle loc- 
ated at the origin. This particle produces daughter particles, which 
are scattered in M, to give the first generation. These first generation 
particles produce their own daughter particles similarly to give the sec- 
ond generation, and so on. Formally, each family is described by the 
collection of points in M giving the positions of the daughters relative to 
the parent. Multiple points are allowed, so that in a family there may be 
several daughter particles born in the same place. As usual in branching 
processes, the nth generation particles reproduce independently of each 
other. The process is assumed supercritical, so that the expected family 
size exceeds one (but need not be finite — indeed even the family size it- 
self need not be finite). Let P and E be the probability and expectation 
for this process and let Z be the generic reproduction process of points 
in K. Thus, EZ is the intensity measure of Z and Z{M) is the family 
size, which will also be written as N . The assumption that the process 
is supercritical becomes that EZ(IR) — E7V > 1. To avoid burdening the 
description with qualifications about the survival set, let P(iV 0) = 0, 
so that the process survives almost surely. 

The model includes several others. One is when each daughter receives 
an independent displacement, another is when all daughters receive the 
same displacement, with the distribution of the displacement being in- 
dependent of family size in both cases. These will be called the BRW 
with independent and common displacements respectively. Obviously, 
in both of these any line of descent follows a trajectory of a random 
walk. (It is possible to consider an intermediate case, where displace- 
ments have these properties conditional on family size, but that is not 
often done.) Since family size and displacements are independent, these 
two processes can be coupled in a way that shows that results for one 
will readily yield results for the other. In a common displacement BRW 
imagine each particle occupying the (common) position of its family. 
Then the process becomes an independent displacement BRW, with a 
random origin given by the displacement of the first family, and its nth 
generation occupies the same positions as the (n-|- l)th generation in the 
original common displacement BRW. Really this just treats each family 
as a single particle. 

In a different direction, the points of Z can be confined to (0,oo) 
and interpreted as the mother's age at the birth of that daughter: the 



framework adopted in Kingman (19751. Then the process is the general 
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branching process associated with the names of Ryan, Crump, Mode 
and Jagers. Finally, when all daughters receive the same positive dis- 
placement with a distribution independent of family size the process is 



the Bellman-Harris branching process: the framework adopted in Ham- 



mersley (1974). 



There are other 'traditions', which consider the BRW but introduce 
and describe it rather differently and usually with other problems in 
focus. There is a long tradition phrased in terms of 'multiplicative cas- 
cades' (see for example Liu (20001 and the references there) and a rather 
shorter one phrased in terms of 'weighted branching' (see for example 



Alsmeyer and Rosier (20061 and the references there). The model has 



arisen in one form or another in a variety of areas. The most obvious 
is as a model for a population spreading through an homogeneous hab- 
itat. It has also arisen in modelling random fractals (Peyriere 2000) 
commonly in the language of multiplicative cascades, in the theoretical 
study of algorithms (Mahmoud 1992), in a problem in group theory 
( Abert and Virag 2005 ) and as an ersatz for both lattice-based models 
of spin glasses in physics (Koukiou 19971 and a number theory problem 
(Lagarias and Weiss 1992 1. 



3 Spreading out: old results 

Let Z*^") be the positions occupied by the nth generation and i?^"' its 
rightmost point, so that 

= sup{z : z a point of Z*")}. 

One can equally well consider the leftmost particle, and the earliest 
studies did that. Reflection of the whole process around the origin shows 
the two are equivalent: all discussion here will be expressed in terms of 
the rightmost particle. The first result, stated in a moment, concerns 
/n converging to a constant, F, which can reasonably be interpreted 
as the speed of spread in the positive direction. 

A critical role in the theory is played by the Laplace transform of 
the intensity measure EZ: let k((/)) = log / e'^^'EZ{dz) for (j) > and 
k{4>) — oo for < 0. It is easy to see that when this is finite for some 
(j) > the intensity measures of Z and Z(") are finite on bounded sets, 
and decay exponentially in their right tail. The behaviour of the leftmost 
particle is governed by the behaviour of the transform for negative values 
of its argument. The definition of k discards these, which simplifies later 
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formulations by automatically keeping attention on the right tail. In 
order to give one of the key formulae for T and for later explanation, let 
K* be the Fenchel dual of k, which is the convex function given by 



K*(a) = sup{0a — 



(3.1) 



This is sufficient notation to give the first result. 
Theorem 3.1 When there is a cj) > such that 

K{<j>) < OO, 

there is a constant T such that 

> r a.s. 



(3.2) 



(3.3) 



and r = sup{a : k* (a) < 0} = inf{K(6l)/6' : 0}. 
This result was proved for the common BRW with only negative dis- 



placements with convergence in probability in Hammersley ( 1974 The- 



orem 2). It was proved in Kingman ( 1975 Theorem 5) for Z concentrated 



on (—00,0) and with < k{4>) < oo instead of (3.2). The result stated 



above is contained in Biggins] (|1976a Theorem 4) , which covers the ir- 



reducible multitype case also, of which more later. The second of the 
formulae for F is certainly well-known but cannot be found in the pa- 
pers mentioned — I am not sure where it first occurs. It is not hard to 
establish from the first one using the definition and properties of k* . 

The developments described here draw on features of transform the- 
ory, to give properties of k, and of convexity theory, to give properties 
of K* and the speed F. There are many presentations of, and notations 
for, these, tailored to the particular problem under consideration. In this 
review, results will simply be asserted. The first of these provides a con- 
text for the next theorem and aids interpretation of sup{a : k* (a) < 0} 
in the previous one. It is that when k is finite somewhere on (0, oo), k* 
is an increasing, convex function, which is continuous from the left, with 
minimum value —k(0) = — logEiV, which is less than zero. 

A slight change in focus derives Theorem |3 . 1 1 from the asymptotics of 
the numbers of particles in suitable half- infinite intervals. As part of the 
derivation of this the asymptotics of the expected numbers are obtained. 



Specifically, it is shown that when (3.2) holds 



1 log (ez(") [na, oo)) -k* (a) 
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(except, possibly, at one a). The trivial observation that when the ex- 
pectation of integer-valued variables decays geometrically the variables 
themselves must ultimately be zero implies that log Z^") [na, oo) is ulti- 
mately infinite on {a : K*(a) > 0}. This motivates introducing a notation 
for sweeping positive values of k* , and later other functions, to infinity 
and so we let 

fo/„^ _ / /(a) wl^cn /(a) < 

^ " \ oo when /(a) > ^ ' 

and K* = {k*)° . The next result can be construed as saying that in crude 
asymptotic terms this is the only way actual numbers differ from their 
expectation. 



Theorem 3.2 When ([3^ holds, 

^log(z(")[na,oo)) ^ -K*(a) a.s., (3.5) 

for all a ^ T. 



From this result, which is Biggins ( 1977a Theorem 2), and the properties 



of K*, Theorem |3 . 1| follows directly. 

A closely related continuous-time model arises when the temporal 
development is a Markov branching process (Bellman-Harris with ex- 
ponential lifetimes) or even a Yule process (binary splitting too) and 
movement is Brownian, giving binary branching Brownian motion. The 
process starts with a single particle at the origin, which then moves with 
a Brownian motion with variance parameter V. This particle splits in 
two at rate A, and the two particles continue, independently, in the same 
way from the splitting point. (Any discrete skeleton of this process is a 
branching random walk.) 

Now, let B*^*^ be the position of the rightmost particle at time t. 
Then u{x,t) = P(i3(*^ < x) satisfies the (Fisher/Kolmogorov-Petrovski- 
Piscounov) equation 

du ,1 d^u , X 
which is easy to see informally by conditioning on what happens in [0, St]. 



The deep studies of Bramson ( 1978a 1983 1 show, among other things, 
that (with V = X = 1) B*^*'' converges in distribution when centred on 
its median and that median is (to 0(1)) 

V2t-i=Qlog.), 



118 



J. D. Biggins 



which imphes that T = y/2 here. For the skeleton at integer times, 
K{e) = + 1 for 6* > 0, and using Theorem 



3.1 



on this confirms that 
r = \/2. Furthermore, for later reference, note that ^F — k{9) = when 

e = V2. 

Theorem 3.1 is for discrete time, counted by generation. There are 
corresponding results for continuous time, where the reproduction is now 
governed by a random collection of points in time and space (K"*" x M). 
The first component gives the mother's age at the birth of this daughter 
and the second that daughter's position relative to her mother. Then the 
development in time of the process is that of a general branching process 
rather than the Galton-Watson development that underpins Theorem 



3.1 This extension is discussed in Biggins (1995) and Biggins (1997). 



In it particles may also move during their lifetime and then branching 
Brownian motion becomes a (very) special case. Furthermore, there are 
also natural versions of The orems |3.1 
are in rather than M — see Biggins ( 1995 



and |3.2| when particle positions 
54.2) and references there. 



4 Spreading out: first refinements 



Obviously rate-of-convergence questions follow on from (3.3). An aside 
in Biggins ( 1977b p33) noted that, typically, B^^-^—nT goes to —oo. The 



following result on this is from Biggins ( 1998 Theorem 3) , and much of it 



is contained also in Liu (1998 Lemma 7.2). When P(Z(r,oo) > 0) > 0, 



so displacements greater than F are possible, and ( |3.2| holds, there is 
a finite i? > with — «(■(?) = 0. Thus the condition here, which will 
recur in later theorems, is not restrictive. 

Theorem 4.1 // there is a finite > with dT — K{-d) ~ 0, then 

- nF -oo a.s., (4.1) 

and the condition is also necessary when P(Z(F, co) > 0) > 0. 

The theorem leaves some loose ends when P(Z(F, oo) = 0) = 1. Then 
_ fiY is a decreasing sequence, and so it does have a limit, but 



whether (4.1) holds or not is really the explosion (i.e. regularity) prob- 



lem for the general branching process: whether, with a point z from Z 
corresponding to a birth time of F — 2:, there can be an infinite number 
of births in a finite time. This is known to be complex — see Grey ( 1974) 



for example. In the simpler cases it is properties of Z{{r}), the number 
of daughters displaced by exactly F, that matters. 
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If Z({r}) is the family size of a surviving branching process (so either 
EZ({r}) > 1 or P(Z({r}) = 1) = 1) it is easy to show that -nF) 
has a finite hmit — so (4.1) fails — using embedded surviving processes 



resulting from focusing on daughters displaced by F: see Biggins ( 1976b 



Proposition II. 5. 2) or Dekking and Host ( 1991 Theorem 1). In a similar 
vein, with extra conditions, Addario-Berry and Reed (2009 Theorem 4) 
show E(i3(") — nV) is bounded. 

Suppose now that (3.2) holds. When P(Z(a,oo) = 0) = 1, simple 
properties of transforms imply that Oa — k{9) f — log EZ{{a}) as 6* f oo- 
Then, when EZ({a}) < 1 a little convexity theory shows that F < a 



and that there is a finite 1} with i^F — k{{)) = 0, so that Theorem 4.1 
applies 



This leaves the case where ( |3.2[ ) holds, P{Z{T, cxo) = 0) = 1 
and EZ({F}) = 1 but P(Z({F}) = 1) < 1, which is sometimes called, 
misleadingly in my opinion, the critical branching random walk because 
the process of daughters displaced by exactly F from their parent forms a 
critical Galton- Watson process. For this case, Bramson ( 1978b' Theorem 
1) and , Dekking and Host ( 1991[ §9) show that (4.1) holds under extra 
conditions including that displacements lie in a lattice, and that the 
convergence is at rate log log n. Bramson (1978b, Theorem 2) also gives 
conditions under which (4.1) fails. 



5 Spreading out: recent refinements 



The challenge to derive analogues for the branching random walk of the 
fine results for branching Brownian motion has been open for a long time. 



Progress was made in McDiarmid ( 1995 ) and, very recently, a nice result 



has been given in Hu and Shi ( 2009 , Theorem 1.2), under reasonably mild 



conditions. Here is its translation into the current notation. It shows that 
the numerical identifications noted in the branching Brownian motion 
case in ^are general. 

Theorem 5.1 Suppose that there is a > with i?F — K(t9) — 0, and 
that, for some e > 0, E(7V^+'') < 00, K{-d + e) < 00 and J e~'''''E,Z{dz) < 
00. Then 



lim inf 



- nF) , - nV) 

< lim sup ■ 



and 



log n 
- nF) 



logn 



logn 



— - in probability. 



a.s. 
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Good progress has also been made on the tightness of th e distributions 
of i?*^"^ when centred suitably. Here is a recent result from Bramson and 



Zeitouni (2009 Theorem 1.1) 



Theorem 5.2 Suppose the BRW has independent or common displace- 
ments according to the random variable X. Suppose also that for some 
e > 0, E(A^^+'^) < oo and that for some ijj > and yo > 

P(X > x + y) <e-'''yp{X > x) Vx > 0,y > yo. (5.1) 

Then the distributions o/{_B''"-'} are tight when centred on their medians. 



It is worth noting that (5.1) ensures that (3.2) holds for all (j) € 



There are other results too — in particular, McDiarmid ( 1995 Theorem 



1) and Dekking and Host (1991 §3) both give tightness results for the 
(general) BRW, but with Z concentrated on a half-line. Though rather 



old for this section, Dekking and Host ( 1991 Theorem 2) is worth re- 
cording here: the authors assume the BRW is concentrated on a lattice, 
but they do not use that in the proof of this theorem. To state it, let 
D be the second largest point in Z when N > 2 and the only point 
otherwise. 

Theorem 5.3 // the points of Z are confined to (— oo,0] and E_D is 
finite, then Ei?^"^ is finite and the distributions o/l-B^"-*} are tight when 
centred on their expectations. 

The condition that ED is finite holds when J e'^^'EZ{dz) is finite in a 
neighbourhood of the origin, which is contained within the conditions in 



Theorem |5.1[ In another recent study Addario-Berry and Reed ( 2009 



Theorem 3) give the following result, which gives tightness and also 
estimates the centring. 

Theorem 5.4 Suppose that there is a d > Q with -dT — K{d) = 0, and 
that, for some e > 0, k(i9 + e) < oo and J e~'^^'EZ{dz) < oo. Suppose 
also that the BRW has a finite maximum family .size and independent 
displacements. Then 

ES(") = "r - ^ logn -f 0(1), 

and there are C > and 6 > such that 

P - EB(")| > < Ce-^"" Vx. 

The conditions in the first sentence here have been stated in a way 
that keeps them close to those in Theorem |5 . 1 1 rather than specialising 
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them for independent displacements. Now, moving from tightness to 
convergence in distribution — which cannot be expected to hold without a 
non-lattice assumption — the following result, which has quite restrictive 



conditions, is taken from Bachmann (2000 Theorem 1). 



Theorem 5.5 Suppose that the BRW has EN < oo and independent 
displacements according to a random variable with density function f 
where — log / is convex. Then the variables i?*^"^ converge in distribution 
when centred on medians. 

It is not hard to use the coupling mentioned in ^J2] to see that The- 
orems |5.4| and |5.5| imply that these two results also hold for common 
displacements. 



6 Deterministic theory 



There is another, deterministic, stream of work concerned with model- 
ling the spatial spread of populations in a homogeneous habitat, and 



closely hnked to the study of reaction-diffusion equations like (3.6). The 



main presentation is Weinberger ( 1982 ) , with a formulation that has 



much in common with that adopted in Hammersley (1974 1. Here the 



description of the framework is pared-down. This sketch draws heavily 
on Weinberger (2002), speciahsed to the homogeneous (i.e. aperiodic) 



case and one spatial dimension. The aim is to say enough to make cer- 
tain connections with the BRW. 

Let be the density of the population (or the gene frequency, 

in an alternative interpretation) at time n and position a; G M. This 
is a discrete-time theory, so there is an updating operator Q satisfy- 



(n+l) 



Q{u 



More formally, let T be the non-negative con- 
1 bounded by /3. Then Q maps J- into itself and 



mg u 

tinuous functions on 
it(") = (5(")(u(°)), where m^'^-' is the initial density and Q^"-* is the nth 
iterate of Q. The operator is to satisfy the following restrictions. The 
constant functions at and at (3 are both fixed points of Q. For any 
function u £ T that is not zero everywhere, (5'"^(it) — )■ /3, and Q{a) > a 
for non-zero constant functions in J-. (Of course, without the spatial 
component, this is all reminiscent of the basic properties of the generat- 
ing function of the family-size.) The operator Q is order-preserving, in 
that if u < V then Q{u) < Q{v), so increasing the population anywhere 
never has deleterious effects in the future; it is also translation-invariant, 
because the habitat is homogeneous, and suitably continuous. Finally, 
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every sequence Um G ^ contains a subsequence Um{i) such that Q{um[i)) 
converges uniformly on compacts. Such a Q can be obtained by tak- 
ing the development of a reaction-diffusion equation for a time r. Then 
q(") gives the development to time nr, and the results for this discrete 
formulation transfer to such equations. 



Specialising Weinberger (2002 Theorem 2.1), there is a spreading 
speed r in the following sense. If u^^\x) = for a; > L and u'^°\x) > 
(5 > for all x < K, then for any e > 



sup 

x>n{r+<i) 



,(«) 



(a;)| and 



sup 

x<n{r- 







|w(")(a;) -/3| ^ 0. 



(6.1) 



In some cases the spreading speed can be computed through linearisation 
— see Weinberger ( 2002 Corollary 2.1) and Lui ( 1989a Corollary to The- 



orem 3.5) — in that the speed is the same as that obtained by replacing Q 
by a truncation of its linearisation at the zero function. So Q{u) = Mu 
for small u and Q{u) is replaced by min{w, Mu}, where a; is a constant, 
positive function with Muj > cj. The linear functional Mu{y) must be 
represented as an integral with respect to some measure, and so, using 
the translation invariance of M , there is a measure /i such that 



Mu{y) = J uiy-z)^iidz). (6.2) 
Let k{9) = log J e^^fi{dz). Then the results show that the speed T in 

m 



(6.1 1 is given by 



r = inf 



(6.3) 



Formally, this is one of the formulae for the speed in Theorem 3.1 In 
fact, the two frameworks can be linked, as indicated next. 

In the BRW, suppose the generic reproduction process Z has points 
{zi}. Define Q by 



'{u{x)) = 1 - E 



1 - ]^U(.T - Zi 



This has the general form described above with /3 = 1. On taking 
u^^\x) = P{B^^'> > x) (i.e. Heaviside initial data) it is easily estab- 

= P(_B(") > x). This is in essence the 



lished by induction that u 
same as the observation that the distribution of the rightmost particle in 



branching Brownian motion satisfies the differential equation (3.6). The 



idea is explored in the spatial spread of the 'deterministic simple epi- 



demic' in Mollison and Daniels (1993), a continuous-time model which. 
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like branching Brownian motion, has BRW as its discrete skeleton. Now 
Theorem 3.1 implies that (6.1) holds, and that, for Q obtained in this 
way, the speed is indeed given by the (truncated) linear approximation. 
The other theorems about i?*^"^ also translate into results about such 

gives conditions for u^"^ when centred 



5.5 



Q. For example. Theorem 
suitably to converge to a fixed (travelling wave) profile. 



7 The multitype case 

Particles now have types drawn from a finite set, 5, and their reproduc- 
tion is defined by random points in 5xM. The distribution of these points 
depends on the parent's type. The first component gives the daughter's 
type and the second component gives the daughter's position, relative 
to the parent's. As previously, Z is the generic reproduction process, 
but now let be the points (in K) corresponding to those of type cr; 

and Z1^ are defined similarly. Let and Ei, be the probability and 
expectation associated with reproduction from an initial ancestor with 

(n) 

type v Q S. Let be the rightmost particle of type a in the nth 

generation, and let i?^"-* be the rightmost of these, which is consistent 
with the one-type notation. 

The type space can be classified, using the relationship 'can have a 
descendant of this type', or, equivalently, using the non-negative expec- 
ted family-size matrix, 'E^Z^-i^)- Two types are in the same class when 
each can have a descendant of the other type in some generation. When 
there is a single class the family-size matrix is irreducible and the process 
is similarly described. When the expected family-size matrix is aperiodic 
(i.e. primitive) the process is also called aperiodic, and it is supercrit- 
ical when this matrix has Perron-Frobenius (i.e. non-negative and of 
maximum modulus) eigenvalue greater than one. Again, to avoid qual- 
ifications about the survival set, assume extinction is impossible from 
the starting type used. 

For 6 > 0, let exp{K{9)) be the Perron-Frobenius eigenvalue of the 
matrix of transforms J e^^Ei,Za{dz), and let K{d) = oo for 6* < 0. If 
there is just one type, this definition agrees with that of k at the start of 
^ The following result, which is |Biggins (1976a Theorem 4), has been 
mentioned already. 



Theorem 7.1 Theorem |3.1| holds for any initial type in a supercritical 
irreducible BRW. 
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The simplest multitype version of Theorem 3.2 is the following, which 
(2009). When a = v \i is a, special case of results 



Biggins 


(2009 


Biggins 


(1997 



Theorem 7.2 For a supercritical aperiodic BRW for which (3.2) holds, 
-log (Z(")[na,( 



,oo. 



(7.1) 



for a 7^ sup{a : k* [a) < 0} = F, and 



B, 



(") 



a.s.-P^. 

n 

Again there is a deterministic theory, following the pattern described in 
^and discussed in Lui ( 1989a|b ), which can be related to Theorem |7.1[ 
Recent developments in that area raise some interesting questions that 
are the subject of the next two sections. 



8 Anomalous spreading 



In the multitype version of the deterministic context of ^ recent papers 
(IWeinberger et all 120021 120071 ILewis et all 120021 ILi et al.l [20051) have 



considered what happens when the type space is reducible. Rather than 
set out the framework in its generality, the simplest possible case, the 
reducible two-type case, will be considered here, for the principal issue 
can be illustrated through it. The two types will be v and 77. Now, the 
vector-valued non-negative function u*^"^ gives the population density 
of two species — the two types, v and 77 — at a:; G K at time n, and Q 
models growth, interaction and migration, as the populations develop in 
discrete time. The programme is the same as that indicated in f|6j that 
is to investigate the existence of spreading speeds and when these speeds 
can be obtained from the truncated linear approximation. 

In this case the approximating linear operator, generalising that given 



in (6.2), is 



{Mu{y)),, = / u,,{y ~ z) fi^r,{dz), 



Uy{y - z) ^luvidz) + / Uniy ~ z) y.ur,{dz) 



{Mu{y)), 



Simplifying even further, assume there is no spatial spread associated 
with the 'interaction' term here, so that ^ u^(y — z) ^^j^{dz) = cuj^{y) 
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for some c > 0. The absence of jiriv in the first of these makes the hnear 
approximation reducible. The first equation is really just for the type 
T] and so will have the speed that corresponds to /x^^, given through 



its transform by (6.3 1, and written F,,. In the second, on ignoring the 
interaction term, it is plausible that the speed must be at least that of 
type V alone, which corresponds to and is written F^. However, it 
can also have the speed of Uri from the 'interaction' term. It is claimed 
in Weinberger et al. (2002 Lemma 2.3) that when Q is replaced by the 



approximating operator minjo;, Mu\ this does behave as just outlined, 
with the corresponding formulae for the speeds: thus that of rj is F^ 
and that for v is max{F^,Fi,}. However, in Weinberger et al. (2007) a 



flaw in the argument is noted, and an example is given where the speed 
of v in the truncated linear approximation can be faster than this, the 
anomalous spreading speed of their title, though the actual speed is not 
identified. The relevance of the phenomenon to a biological example is 
explored in 'Weinberger et al. ( 2007[ §5). 

As in ^ the BRW provides some particular examples of Q that fall 
within the general scope of the deterministic theory. Specifically, suppose 
the generic reproduction process Z has points {ai,Zi} £ S xR. Now let 
Q, which operates on vector functions indexed by the type space S, be 
defined by 



1-E. 



1 - ]^Ucr,(a; - Zi) 



Then, just as in the one-type case, when u''^\x) ~ Py(i?^°'' > x) in- 
duction estabhshes that u^\x) — (S^") > x). It is perhaps worth 
noting that in the BRW the index i/ is the starting type, whereas it is 
the 'current' type in Weinberger et al. (2007). However, this makes no 
formal difference. 

Thus, the anomalous spreading phenomenon should be manifest in the 
BRW, and, given the more restrictive framework, it should be possible 
to pin down the actual speed there, and hence for the corresponding Q 
with Heaviside initial data. This is indeed possible. Here the discussion 
stays with the simplifications already used in looking at the deterministic 
results. 

Consider a two-type BRW in which each type v always produces at 
least one daughter of type i', on average produces more than one, and 
can produces daughters of type rj — but type rj never produce daughters 
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of type V. Also for 6 >Q\et 



K^{e)=\og j e^'E^Z^idz) and k^(6') = log J e^^E^Z,,(dz) 



and let these be infinite for 9 < 0. Thus Theorem 3.2 applies to type ly 
considered alone to show that 



- log (^Zl"^[na, oo)j -K*(a) 



It turns out that this estimation of numbers is critical in establishing 
the speed for type rj. It is possible for the growth in numbers of type v, 
through the numbers of type rj they produce, to increase the speed of 
type rj from that of a population without type v. This is most obvious if 
type r/ is subcritical, so that any line of descent from a type r/ is finite, 
for the only way they can then spread is through the 'forcing' from type 
v. However, if in addition the dispersal distribution at reproduction for 
r] has a much heavier tail than that for v it is now possible for type rj to 
spread faster than type v. 

For any two functions / and g, let £[/, 5] be the greatest (lower semi- 
continuous) convex function beneath both of them. The following result 



is a very special case of those proved in Biggins (2009). The formula 



given in the next result for the speed is the same as that given in 



Weinberger et al. (2007 Proposition 4.1) as the upper bound on the 



speed of the truncated linear approximation. 

Theorem 8.1 Suppose that max{Ki^(0,y), k,,((/)^)} is finite for some 
(f'ri 1^ 4>i' > and that 



J e'^'-E^Z^{dz) < 00 ¥6* > 0. 
Let r — €[kI, K*]° . Then 

^ log (Z^")[na,oo)) ^ -r{a) a.s.-P^ 
for a ^ sup{a : r(a) < 0} = F^, and 



.1) 



B 



(") 



Furthermore, 



F^ a.s.-P,, 



rt^ inf nraxj^,^ 



(8.2) 



(8.3) 



•4) 
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From this result it is possible to see how can be anomalous. Suppose 
that r(r''') — 0, so that F''' is the speed, and that r is strictly below 
both and at F^^. This will occur when the minimum of the two 
convex functions k* and k* is not convex at F^, and then the largest 
convex function below both will be linear there. In these circumstances, 
K*(Ft) > 0, which implies that K*(Ft) = oo, and ^^(Ft) > 0. Thus F^ 
will be strictly greater than both Fi, and F^, giving a 'super-speed' — 
Figure [O] illustrates a case that will soon be described fully where T^, 
and F^ are equal and F^ exceeds them. Otherwise, that is when F^ is 
not in a linear portion of r, F^ is just the maximum of F^ and F,,. 



The example in Weinberger et al. (2007) that illustrated anomalous 



speed was derived from coupled reaction-diffusion equations. When there 
is a branching interpretation, which it must be said will be the exception 
not the rule, the actual speed can be identified through Theorem |8 . 1 1 and 
its generalisations. This will now be illustrated with an example. Suppose 
type rj particles form a binary branching Brownian motion, with variance 
parameter and splitting rate both one. Suppose type v particles form a 
branching Brownian motion, but with variance parameter V , splitting 
rate A and, on splitting, type v particles produce a (random) family of 
particles of both types. There are 1 -I- Ni, of type v and iV^ of type ry, 
so that the family always contains at least one daughter of type v; the 
corresponding bivariate probability generating function is 'Eio^^^^b^^ — 
af{a,b). Let v{x,t) = P,,(B(*) < x) and w{x,t) = P^(B(*) < x). These 
satisfy 



dv 1 d^v 
di " 29^2 ~ 
dw 1 d'^w 
Ik " 2d^ 



v{l - w), 

- Au;(l - f{w,v)). 



Here, when the initial ancestor is of type u and at the origin the initial 
data are w{x, 0) = 1 for a; > and otherwise and v{x, 0) = 0. Note 
that, by a simple change of variable, these can be rewritten as equations 
in P^(i?*^*-' > a;) and Pi,{B'^*''> > x) where the differential parts are 
unchanged, but the other terms look rather different. 

Now suppose that a/(a, 6) = a'^{l—p+pb), so that a type v particle al- 
ways splits into two type v and with probability p also produces one type 
7]. Looking at the discrete skeleton at integer times, K,y{0) = V9'^ /2 + X 
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for 6* > 0, giving 



-A 



a < 



<(«) = 



la" 



and speed {2V\y^^, obtained by solving K*(a) = 0. The formulae for 
K* are just the special case with V = X = 1. Now, for convenience, take 
V = A~^, so that both types, considered alone, have the same speed. 
Then, sweeping positive values to infinity. 



{-A a<0, 
oo a>2i/2. 
Now is the largest convex function below this and k*. When 



A = 3 these three functions are drawn in Figure 8.1 




Key: solid — a* (with A = 3); dotted — k*; dasliod — e[«J, h* 
Speeds: r„ = T, = = 1.41 . . .; Pt = 4/x/6 = 1.63 . . .. 



Figure 8.1 Illustration of how anomalous speed arises. 



The point where each of them meets the horizontal axis gives the value 
of speed for that function. Thus, F^ exceeds the other two, which are 
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both %/2. Here = 4/^6. In general, for A > 1, it is (l + A)/\/2A, which 
can be made arbitrarily large by increasing A sufficiently 



9 Discussion of anomalous spreading 



The critical function in Theorem 



8.1 



is r = Here is how it 

arises. The function k® describes the growth in numbers and spread of 
the type v. Conditional on these, describes the growth and 

spread in expectation of those of type rj. To see why this might be 
so, take a b with < so that (|3.5[) describes the exponential 



growth of zi"^\mb, oo): there are roughly exp(— mK*(6)) such particles 
in generation m. Suppose now, for simplicity, that each of these pro- 
duces a single particle of type ry at the parent's position. As noted 



just before Theorem 3.2 the expected numbers of type rj particles in 
generation r and in [rc, oo) descended from a single type rj at the ori- 
gin is roughly exp(— tk* (c)). Take A € (0,1) with m = An and r = 
(1 — A)n. Then, conditional on the development of the first m gener- 
ations, the expectation of the numbers of type rj in generation n and 
to the right of mb + rc — n{\b -f- (1 — A)c) will be (roughly) at least 
exp(— n(AK*(6)-|-(l — A)k*(c))). As 6, c and A vary with A5-|-(l — A)c — a, 
the least value for Ak* (6) + (1 — A)k* (c) is given by , K^](a). There is 
some more work to do to show that this lower bound on the conditional 



expected numbers is also an upper bound — it is here that (8.11 comes 
into play. Finally, as indicated just before Theorem |3. 2 [ this corresponds 
to actual numbers only when negative, so the positive values of this 
convex minorant are swept to infinity. 

When the speed is anomalous, this indicative description of how r = 
arises makes plausible the following description of lines of 
descent with speed near They will arise as a 'dog-leg', with the first 
portion of the trajectory, which is a fixed proportion of the whole, being 
a line of descent of type 77 with a speed less than F^. The remainder is 
a line of descent of type with a speed faster than F^ (and also than 
Ft). 

Without the truncation, the linear operator approximating (near u = 
1) aQ associated with a BRW describes the development of its expected 
numbers, and so it is tempting to define the speed using this, by look- 
ing at when expected numbers start to decay. In the irreducible case. 
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Theorem 7.2 has an analogue for expected numbers, that 
-log(E,Z(")[na,oo)) ^ -K*(a), 



and so here the speed can indeed be found by looking at when expected 



numbers start to decay. In contrast, in the set up in Theorem 8.1 

^log(E,4")[na,(^)) ^ -£[<,«;] (a), 

and the limit here can be lower than , k*] — the distinction between 
the functions is whether or not positive values are swept to infinity in 
the first argument. Hence the speed computed by simply asking when 
expectations start to decay can be too large. In Figure 



.1 



is 

the same as , k*], but it is easy to see, reversing the roles of k* and 
K*, that is the same as k*. Thus if rj could produce z/, rather 

than the other way round, expectations would still give the speed but 
the true speed would be Tjj{— F,,). 

The general case, with many classes, introduces a number of addi- 
tional challenges (mathematical as well as notational). It is discussed in 



Biggins ( 2009 1 . The matrix of transforms now has irreducible blocks on 
its diagonal, corresponding to the classes, and their Perron-Frobenius 
eigenvalues supply the k for each class, as would be anticipated from 
^ Here a flavour of some of the other complications. The rather strong 



condition (8.1) means that the spatial distribution of type -q daughters 
to a type v mother is irrelevant to the form of the result. If convergence 
is assumed only for some > rather than all this need not remain 
true. One part of the challenge is to describe when these 'off-diagonal' 
terms remain irrelevant; another is to say what happens when they are 
not. If there are various routes through the classes from the initial type 
to the one of interest these possibilities must be combined: in these cir- 



cumstances, the function r in (8.2) need not be convex (though it will 



be increasing) . It turns out that the formula for F^ , which seems as if it 
might be particular to the case of two classes, extends fully — not only 
in the sense that there is a version that involves more classes, but also 
in the sense that the speed can usually be obtained as the maximum of 



that obtained using (8.4) for all pairs of classes where the first can have 
descendants in the second (though the line of descent may have to go 
through other classes on the way). 
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